Web-based language modelling for automatic lecture transcription

نویسندگان

  • Cosmin Munteanu
  • Gerald Penn
  • Ronald Baecker
چکیده

Universities have long relied on written text to share knowledge. As more lectures are made available on-line, these must be accompanied by textual transcripts in order to provide the same access to information as textbooks. While Automatic Speech Recognition (ASR) is a cost-effective method to deliver transcriptions, its accuracy for lectures is not yet satisfactory. One approach for improving lecture ASR is to build smaller, topic-dependent Language Models (LMs) and combine them (through LM interpolation or hypothesis space combination) with general-purpose, large-vocabulary LMs. In this paper, we propose a simple solution for lecture ASR with similar or better Word Error Rate reductions (as well as topic-specific keyword identification accuracies) than combination-based approaches. Our method eliminates the need for two types of LMs by exploiting the lecture slides to collect a web corpus appropriate for modelling both the conversational and the topic-specific styles of lectures.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised topic adaptation for lecture speech retrieval

We are developing a cross-media information retrieval system, in which users can view specific segments of lecture videos by submitting text queries. To produce a text index, the audio track is extracted from a lecture video and a transcription is generated by automatic speech recognition. In this paper, to improve the quality of our retrieval system, we extensively investigate the effects of a...

متن کامل

Language Model Adaptation with the Use of Presentation Slide Information for Automatic Lecture Transcription

We propose a language model adaptation method with the use of presentation slide information for automatic lecture transcription. N-gram probabilities are rescaled with lecture-dependent unigram probabilities estimated by PLSA using all slides of the lecture. In addition, the N-gram language model is interpolated with a model trained with the Web texts collected via the Web search, using keywor...

متن کامل

Automatic transcription of lecture speech using topic-independent language modeling

We approach lecture speech recognition with a topicindependent language model and its adaptation. As lecture speech has its characteristic style that is different from newspapers and conversations, dedicated language modeling is needed. The problem is that, although lectures have many keywords specific to the topic and fields, available corpus of each domain is limited in size. Thus, we introdu...

متن کامل

- 17 - Interactive Phonetics , virtually !

This paper presents a set of phonetics teaching resources as modules in a more generic framework for web-based tutoring in the areas of phonetics, multimedia communication and spoken language research. Currently the toolkit consists of standalone interactive modules and lecture notes on a number of areas of phonetics, phonology and the lexicography of spoken language. The interactive presentati...

متن کامل

Automatic Transcription of Lecture Speech using Language Model Based on Speaking-Style Transformation of Proceeding Texts

For language modeling of spontaneous speech recognition, we propose a style transformation approach, which transforms written texts to a spoken-style language model. Since these two styles are largely different and thus direct transformation is difficult, we cascade two transformation methods; rule-based transformation to rewrite written-style texts to intermediate “verbatim” texts, and statist...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007